29,353 research outputs found
Understanding structure of concurrent actions
Whereas most work in reinforcement learning (RL) ignores the structure or relationships between actions, in this paper we show that exploiting structure in the action space can improve sample efficiency during exploration. To show this we focus on concurrent action spaces where the RL agent selects multiple actions per timestep. Concurrent action spaces are challenging to learn in especially if the number of actions is large as this can lead to a combinatorial explosion of the action space.
This paper proposes two methods: a first approach uses implicit structure to perform high-level action elimination using task-invariant actions; a second approach looks for more explicit structure in the form of action clusters. Both methods are context-free, focusing only on an analysis of the action space and show a significant improvement in policy convergence times
True Neutrality as a New Type of Flavour
A classification of leptonic currents with respect to C-operation requires
the separation of elementary particles into the two classes of vector C-even
and axial-vector C-odd character. Their nature has been created so that to each
type of lepton corresponds a kind of neutrino. Such pairs are united in
families of a different C-parity. Unlike the neutrino of a vector type, any
C-noninvariant Dirac neutrino must have his Majorana neutrino. They constitute
the purely neutrino families. We discuss the nature of a corresponding
mechanism responsible for the availability in all types of axial-vector
particles of a kind of flavour which distinguishes each of them from others by
a true charge characterized by a quantum number conserved at the interactions
between the C-odd fermion and the field of emission of the corresponding types
of gauge bosons. This regularity expresses the unidenticality of truly neutral
neutrino and antineutrino, confirming that an internal symmetry of a
C-noninvariant particle is described by an axial-vector space. Thereby, a true
flavour together with the earlier known lepton flavour predicts the existence
of leptonic strings and their birth in single and double beta decays as a unity
of flavour and gauge symmetry laws. Such a unified principle explains the
availability of a flavour symmetrical mode of neutrino oscillations.Comment: 19 pages, LaTex, Published version in IJT
ContextVP: Fully Context-Aware Video Prediction
Video prediction models based on convolutional networks, recurrent networks,
and their combinations often result in blurry predictions. We identify an
important contributing factor for imprecise predictions that has not been
studied adequately in the literature: blind spots, i.e., lack of access to all
relevant past information for accurately predicting the future. To address this
issue, we introduce a fully context-aware architecture that captures the entire
available past context for each pixel using Parallel Multi-Dimensional LSTM
units and aggregates it using blending units. Our model outperforms a strong
baseline network of 20 recurrent convolutional layers and yields
state-of-the-art performance for next step prediction on three challenging
real-world video datasets: Human 3.6M, Caltech Pedestrian, and UCF-101.
Moreover, it does so with fewer parameters than several recently proposed
models, and does not rely on deep convolutional networks, multi-scale
architectures, separation of background and foreground modeling, motion flow
learning, or adversarial training. These results highlight that full awareness
of past context is of crucial importance for video prediction.Comment: 19 pages. ECCV 2018 oral presentation. Project webpage is at
https://wonmin-byeon.github.io/publication/2018-ecc
Studies of oxide/ZnO near-interfacial defects by photoluminescence and deep level transient spectroscopy
The evolution of near-interfacial defects from Al2 O3 ZnO and MgOZnO upon thermal annealing has been studied by photoluminescence, deep level transient spectroscopy, and secondary ion mass spectroscopy. We find that all the results are strongly connected and that they point to the direction that Zn outdiffuses from ZnO to the oxide layer during annealing and creates deep level defects near the interfacial region. These defects reduce the band-edge emission and increase the deep level emission at 2.37 eV. Our study shows that the oxide/ZnO interface is relatively fragile and caution must be taken for making metal-oxide-ZnO based transistors and light emitting diodes. © 2008 American Institute of Physics.published_or_final_versio
Metrics with Prescribed Ricci Curvature near the Boundary of a Manifold
Suppose is a manifold with boundary. Choose a point . We
investigate the prescribed Ricci curvature equation \Ric(G)=T in a
neighborhood of under natural boundary conditions. The unknown here is
a Riemannian metric. The letter in the right-hand side denotes a
(0,2)-tensor. Our main theorems address the questions of the existence and the
uniqueness of solutions. We explain, among other things, how these theorems may
be used to study rotationally symmetric metrics near the boundary of a solid
torus . The paper concludes with a brief discussion of the Einstein
equation on .Comment: 13 page
Assessing the Potential of Classical Q-learning in General Game Playing
After the recent groundbreaking results of AlphaGo and AlphaZero, we have
seen strong interests in deep reinforcement learning and artificial general
intelligence (AGI) in game playing. However, deep learning is
resource-intensive and the theory is not yet well developed. For small games,
simple classical table-based Q-learning might still be the algorithm of choice.
General Game Playing (GGP) provides a good testbed for reinforcement learning
to research AGI. Q-learning is one of the canonical reinforcement learning
methods, and has been used by (Banerjee Stone, IJCAI 2007) in GGP. In this
paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe,
Connect Four, Hex)\footnote{source code: https://github.com/wh1992v/ggp-rl}, to
allow comparison to Banerjee et al.. We find that Q-learning converges to a
high win rate in GGP. For the -greedy strategy, we propose a first
enhancement, the dynamic algorithm. In addition, inspired by (Gelly
Silver, ICML 2007) we combine online search (Monte Carlo Search) to
enhance offline learning, and propose QM-learning for GGP. Both enhancements
improve the performance of classical Q-learning. In this work, GGP allows us to
show, if augmented by appropriate enhancements, that classical table-based
Q-learning can perform well in small games.Comment: arXiv admin note: substantial text overlap with arXiv:1802.0594
Multiplication and Composition in Weighted Modulation Spaces
We study the existence of the product of two weighted modulation spaces. For
this purpose we discuss two different strategies. The more simple one allows
transparent proofs in various situations. However, our second method allows a
closer look onto associated norm inequalities under restrictions in the Fourier
image. This will give us the opportunity to treat the boundedness of
composition operators.Comment: 49 page
Multi-model SAR image despeckling
A multi-model despeckling approach for SAR image is presented. The chi-squared test is used to segment the image into homogeneous and heterogeneous regions. Then, the heterogeneous regions are separated into subregions, each of which consists of the points with same edge orientations. Homogeneous regions and the separated subregions are despeckled according to their characteristics. Experimental results are reported
Concurrent adaptation to opposing visual displacements during an alternating movement.
It has been suggested that, during tasks in which subjects are exposed to a visual rotation of cursor feedback, alternating bimanual adaptation to opposing rotations is as rapid as unimanual adaptation to a single rotation (Bock et al. in Exp Brain Res 162:513â519, 2005). However, that experiment did not test strict alternation of the limbs but short alternate blocks of trials. We have therefore tested adaptation under alternate left/right hand movement with opposing rotations. It was clear that the left and right hand, within the alternating conditions, learnt to adapt to the opposing displacements at a similar rate suggesting that two adaptive states were formed concurrently. We suggest that the separate limbs are used as contextual cues to switch between the relevant adaptive states. However, we found that during online correction the alternating conditions had a significantly slower rate of adaptation in comparison to the unimanual conditions. Control conditions indicate that the results are not directly due the alternation between limbs or to the constant switching of vision between the two eyes. The negative interference may originate from the requirement to dissociate the visual information of these two alternating displacements to allow online control of the two arms
Improved glucose tolerance in acyl CoA:diacylglycerol acyltransferase 1-null mice is dependent on diet
BACKGROUND: Mice that lack acyl CoA:diacylglycerol acyltransferase (Dgat1(-/- )mice) are reported to have a reduced body fat content and improved glucose tolerance and insulin sensitivity. Studies so far have focussed on male null mice fed a high fat diet and there are few data on heterozygotes. We compared male and female Dgat1(-/-), Dgat1(+/- )and Dgat1(+/+ )C57Bl/6 mice fed on either standard chow or a high fat diet. RESULTS: Body fat content was lower in the Dgat1(-/- )than the Dgat1(+/+ )mice in both experiments; lean body mass was higher in male Dgat1(-/- )than Dgat1(+/+ )mice fed on the high fat diet. Energy intake and expenditure were higher in male Dgat1(-/- )than Dgat1(+/+ )mice; these differences were less marked or absent in females. The body fat content of female Dgat1(+/- )mice was intermediate between that of Dgat1(-/- )and Dgat1(+/+ )mice, whereas male Dgat1(+/- )mice were similar to or fatter than Dgat1(+/+ )mice. Glucose tolerance was improved and plasma insulin reduced in Dgat1(-/- )mice fed on the high fat diet, but not on the chow diet. Both male and female Dgat1(+/- )mice had similar glucose tolerance to Dgat1(+/+ )mice. CONCLUSION: These results suggest that although ablation of DGAT1 improves glucose tolerance by preventing obesity in mice fed on a high fat diet, it does not improve glucose tolerance in mice fed on a low fat diet
- âŠ